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Abstract 

Background: Neutral lipid storage is enhanced by nitrogen deprivation (ND) in numbers of green microalgal 
species. However, little is known about the metabolic pathways whose transcription levels are most significantly 
altered following ND in green microalgae, especially the nonmodel species. 

Results: To start gaining knowledge on this, we performed transcriptome profiling of the nonmodel green 
microalga Botryosphaerella sudeticus cells in response to ND. Transcriptome of B. sudeticus is de novo assembled 
based on millions of HiSEQ short sequence reads using CLC Genomics Workbench software. The resulting 
non-redundant ESTs are annotated based on the best hits generated from the BLASTX homology comparison 
against the "best" proteins in the model microalgae Chlamydomonas reinhardtii and Chlorella variabilis. By using a 
pathway-based approach according to KEGG databases, we show that ESTs encoding ribosomal proteins and 
photosynthetic functions are the most abundantly expressed ESTs in the rapidly growing B. sudeticus cells. We find 
that ESTs encoding photosynthetic function but not the ribosomal proteins are most drastically downregulated 
upon ND. Notably, ESTs encoding lipid metabolic pathways are not significantly upregulated. Further analyses 
indicate that chlorophyll content is markedly decreased by 3-fold and total lipid content is only slightly increased 
by 50%, consistent with the transcriptional profiling. On the other hand, carbon content and photosynthesis 
efficiency are only marginally decreased by 7% and 20%, respectively, indicating that photosynthesis is only slightly 
reduced upon drastic downregulation of photosynthetic ESTs and chlorophyll content upon ND. In addition, TAG 
content is found to be greatly increased by 50-fold, though total lipid content is only slightly increased by 1.5-fold. 

Conclusions: Taken together, our results suggest that light-harvesting proteins and chlorophylls are in excess in 
B. sudeticus. Degradation of excess photosynthesis proteins is most likely a mechanism for recycling of nitrogen-rich 
molecules to synthesize new proteins for preparation of gametogenesis and zygospore formation in adaptation and 
survival upon ND. Furthermore, our analyses indicate that TAG accumulation is largely attributed to the 
modification of other pre-existing lipid molecules, rather than de novo synthesis. We propose that this is likely an 
evolutionarily conserved mechanism in many green microalgae species. 
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Background 

Economic growth is often accompanied by the increase 
in energy consumption. Depletion of fossil fuel reserves 
and rising crude oil prices prompt renewed interest in 
algae biofuel research. Green microalgae are unicellular 
photosynthetic organisms capable of converting photon 
energy into chemical energy and assimilating carbon 
dioxide to form glucose. It has been proposed that 
microalgae farming could avoid competing with food 
crops for arable land and fresh water and the energy 
yield from algae per acre per year would be much higher 
than other crops [1-3]. In addition, it mitigates green- 
house gas emission. However, current microalgal species 
and cultivation methodologies are thought to hamper 
production of algae-based energy in an economically vi- 
able manner [1,4]. 

Many studies have focused on the enhancement of lipid 
contents in microalgae. Nitrogen deprivation (ND) is one 
of the most widely used methods to increase lipid storage 
in algae [4-10]. It is known that ND triggers game- 
togenesis and zygospore formation in the model green 
microalga C. reinhardtii [11]. Zygospores can survive un- 
der harsh conditions without nutrients for long period of 
time and ready to revive through germination when exter- 
nal nutrients become available. Storage molecules such as 
starch and fats (e.g., triacylglycerol or TAG) are the major 
source of energy essential for spore germination before 
they are capable of assimilating external nutrients. There- 
fore, enhancement of lipid storage following ND is at the 
cost of cell growth arrest and differentiation. 

Global transcriptional profiling of microalgal cells in 
response to ND using next-generation sequencing (or 
NGS) technologies allows identification of gene regula- 
tory networks involved in adaptation and survival [7,12]. 
Global transcriptional profiling indicates that alteration 
of lipid metabolic pathways is complex: enhancement of 
TAG accumulation could be a result of recycling of 
membrane lipids and de novo glycerolipid biosynthesis 
[7]. Comparative transcriptome analysis with other algal 
species may allow gaining insight into molecular mecha- 
nisms underlying the metabolic pathways involved in 
growth arrest and biosynthesis of storage molecules. 

Currently, a number of microalgal genome such as 
Chlamydomonas reinhardtii and Chlorella variabilis 
are fully sequenced and comprehensively annotated 
[13,14]. Additionally, assembly of genomes and trans- 
criptomes from the millions of short sequence reads gen- 
erated by NGS technologies has been greatly aided by the 
use of de Bruijn graph-based sequence-alignment algo- 
rithms [15-18]. Several green microalgal transcrip tomes 
have been assembled without reference genomes by using 
these methodologies [19-21]. However, most of these ana- 
lyses are primarily focused on gene discovery and meta- 
bolic pathway reconstitution. 



The green microalga Botryosphaerella sudeticus (UTAX2629) 
has been previously isolated from a culture of the 
hydrocarbon-rich slow-growing microalga Botryococcus 
braunii as a contaminant [22]. Phylogenetic analysis based 
on rDNA sequence indicates that B. sudeticus is more re- 
lated to Characiopodium hindakii rather than B. braunii 
[23]. Observation of gametes leads to the suggestion that 
B. sudeticus may have sexual life cycle [24]. We showed 
that the maximal cell density of B. sudeticus cultures was 
2-fold of that of C. reinhardtii under conditions of photo- 
trophic growth, suggesting a useful candidate for algal 
biomass production. Gametogenesis and lipid accumula- 
tion were induced by ND. To investigate transcriptional 
alteration of metabolic pathways following ND, we per- 
formed global transcriptional profiling of B. sudeticus cells 
in response to ND based on the transcriptome assembled 
without reference genome. By using a pathway-based ap- 
proach, we show that ESTs encoding photosynthetic func- 
tion and ribosomal proteins are the most abundant ESTs 
in rapidly growing B. sudeticus cells. Upon ND, photosyn- 
thetic function-encoding but not ribosomal protein- 
encoding ESTs are drastically downregulated, suggesting a 
programmed cellular response for differentiation. This 
transcriptional response to ND is found to be conserved 
in C. reinhardtii, possibly in most green microalgae. 

Results 

Gametogenesis and lipid accumulation are enhanced by 
nitrogen deprivation (ND) in 6. sudeticus 

B. sudeticus (UTEX2629) was often found to overgrown 
in cultures of B. braunii (UTEX572) (Sun and Liu, un- 
published data). It was partiy because of the slow growth 
rate of B, braunii. To investigate how well B. sudeticus 
grew, we tested its phototrophic growth in media of HS 
[25], BE [26], and 2x BE in shaking flask with bubbling of 
CO2 under constant light (i.e., 250 jimol photon m'^ sec'^). 
We compared the growth curve of B. sudeticus with that 
of C reinhardtii, a best-studied green microalga. The 
growth rate and the maximal cell density of B. sudeticus 
and C. reinhardtii were very similar in HS medium 
(Figure lA). On the other hand, though the growth rate 
of B. sudeticus remained to be similar as that of C. 
reinhardtii, the maximal cell density of B. sudeticus 
was increased by ~25% in IxEE medium and nearly by 
100% (or 2-fold) in 2x EE medium compared to that of C. 
reinhardtii. This result suggests that B. sudeticus is a use- 
ful strain for potential microalgal biomass production. 

We subsequently investigate if gametogenesis would be 
induced by ND. We noted that there were ~1% of gametes 
in log-phase growth culture (Figure IB; upper panel). On 
the other hand, ~10% of gametes were found three days 
after ND. Though we did not obtain the cell-cell fusion 
event in EM analysis (Data not shown), aggregation and 
precipitation of cells upon ND was apparent (Figure IE; 
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Figure 1 Characterization of growth, gametogenesis, and lipid content in B. sudeticus. (A) fi sudeticus culture reaches relatively high 

maximal cell density. Growth curves of 6. sudeticus (in red) and C reinliardtii (in black) in various media indicated are shown. X- and Y-axis 

indicate time (in day) and cell dry weight (CDW; in g/L), respectively. HS and BB stand for High Salt and Bold-modified Bristol media, respectively. 

Error bars represent the standard deviation of triplicate samples. (B) Gametogenesis is induced by ND in 6. sudeticus. Upper panel shows a 

gamete whose tip of flagella is denoted by an arrow. Bottom panel shows part of conical flasks in which sedimented cell aggregates are denoted 

by an arrow-head. Medium with nitrogen (+^2) or without nitrogen (-N2) is indicated. (C) Lipid accumulation is enhanced by ND in 6. sudeticus. 

Left panel shows the increased intensity of Nile red signal in cells after ND. X- and Y-axis indicate the time (in day) and Nile red signal level (in 

arbitrary units), respectively. Nile red signal level of cell samples are based on the median level derived from FAGS analysis as shown in the mid 

panel. Right panel shows the Nile red stained cells prior to (+N2) and after (-N2) ND. (D) Level of triacylglycerol (TAG) and squalene is increased 

in S. sudeticus following ND. Left panel shows the TLC analysis and right panel shows the GG-MS analysis (IVIS result is not shown) of total lipid in 

6. sudeticus prior to (+N2) and after (-N2) ND. (E) Gontents of nitrogen (N), carbon (G), and total lipid (TL). Bar-plots show the level of N, G, and TL 

against GDW in nitrogen-replete (+N2) and nitrogen-starved (-N2) cells. 
^ J 



bottom panel). Hence, we concluded that ND induced 
gametogenesis in B. sudeticus. 

To examine whether ND would enhance lipid storage, 
nitrogen-deprived (ND) cells were stained with a lipophilic 
dye Nile red and subjected to FACS analysis (Figure IC). 
Average level of Nile red signals in cells 3 days after ND 
was increased by 40-fold compared to that prior to ND. 
TLC and GC-MS analyses indicated that TAG and 



squalene (SQL) contents (i.e., % of CDW) were in- 
creased by ~50-fold and 2.5-fold compared to that of 
nitrogen-replete cells (i.e., TAG, 3.74±0.14% versus 
0.08±0.01% of CDW, p-value = 1.5e-06; SQL, 0.15±0.01 
versus 0.06±0.01% of CDW, p-value = 6.4e-05), re- 
spectively (Figure ID; Additional file 1: Table S8). Not- 
ably, total lipid of the early log-phase cells exhibited 
the relatively low level of TAG content. 
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Actively growing B. sudeticus cultures were maintained 
by subculturing or 8-fold dilution in fresh BB+N 
medium prior to exhaustion of nitrate in the medium 
(i.e., 3.8±0.3 mg/L nitrate at day 3 prior to dilution/ 
subculturing). We noted that the cell mass continued to 
increase under ND conditions (i.e., CDW from 0.14±0.03 
g/L at day 0 to 0.94±0.06 g/L at day 3 under ND condi- 
tions). To examine the alteration of nitrogen (N) and 
carbon (C) contents in cells after ND, we performed 
elemental analysis of N and C in cells. It was clear that 
N contents in the ND cells was greatly dropped by 2.5- 
fold compared to that of nitrogen-replete cells (i.e., 2.8± 
0.09% versus 7.14±0.16% of CDW, p-value = 2.16E-06) 
(Figure IE). On the other hand, the C content in the 
ND cells was only slightly reduced compared to that of 
nitrogen-replete cells (i.e., 42.7±0.31% versus 45.9±0.46% 
of CDW). This result suggests that carbon assimilation 
is not strongly affected upon ND for 3 days. While 
TAG content were drastically increased by 50-fold 
upon ND stress (i.e., 3.47±0.14% versus 0.08±0.01% of 
CDW; p-value = 1.5e-06), total lipid (TL) content in 
ND cells was only slightly increased by 1.54-fold com- 
pared to that of nitrogen-replete cells (i.e., 37.37±1.42% 
versus 24.21±2.71% of CDW). This result indicates that 
majority of the storage TAG molecules in ND cells are 
accumulated through the modification of pre-existing 
lipids, rather than de novo synthesis. 

De novo assembly and annotation of the B. sudeticus 
transcriptome 

For analysis of transcriptional profiling of B. sudeticus 
cells in response to ND, we first wanted to assemble the 
transcriptome without reference genome. For this reason, 
cDNA libraries were constructed from the ND and 
nitrogen-replete cells. A total of 52,000,000 paired-reads 
(90-bp in length) pooled from both growth conditions 
were subjected to de novo transcriptome assembly using 
the CLC Genomics Workbench software (see Methods). 
To this end, 56,000 non-redundant contigs/ scaffolds/ 
ESTs (ESTs were used hereafter) were generated. Length 
of ESTs ranged from 200 to 15,487 bps with a median of 
369 bps or average of 660 bps (Figure 2A). Approx. 90% 
of all reads in either growth condition was mapped to the 
ESTs. Absolute level of ESTs was normalized to RPKM 
(7?eads Per /<ilobase per Afillion mapped reads) ranged 
from 0.09 to 17,293 with a median of 2.21 (Figure 2B). 

B. sudeticus ESTs were annotated based on the best-hit 
proteins generated by sequence homology comparison 
using the Basic Local Alignment Search Tool BLASTX 
suite (http://blast.ncbi.nlm.nih.gov) against the "best" pro- 
teins in the model green microalgae C. reinhardtii and C. 
variabilis [13,14] with a cutoff expectation-value of lE-06. 
As a result, a total of 7,625 (13.6%) non-redundant ESTs 
in B. sudeticus were annotated, of which, 6,559 ESTs had 



best-hits in C. reinhardtii and 1,066 in C. variabilis 
[13,14] (Additional file 2: Table SI). 

Gene Ontology (GO) [27] and KEGG Ortholog (KO) 
[28] annotations that associated with the best-hit pro- 
teins in C. reinhardtii and C. variabilis were extracted 
for transcriptome analysis in B. sudeticus. Of a total of 
7,625 annotated ESTs, 4,860 were found to be associated 
with at least one of the 2,122 GO-terms (i.e., 433 terms 
in Biological Processes, 152 terms in Cellular Compo- 
nents, and 1,537 terms in Molecular Functions). GO 
terms associated with the highest number of ESTs in 
three categories were metabolism (322 ESTs; biological 
process), membrane (361 ESTs; cellular component), and 
ATP-binding (744 ESTs; molecular function). The top 10 
GO-terms in three categories were shown in Figure 2C 
(Additional file 3: Table S2). 

Out of 7,625 ESTs, 1,129 were found to be associated 
with at least one of the 114 KEGG metabolic pathways 
[28]. Top 30 KEGG metabolic pathways were shown in 
Figure 2D (Addftional file 4: Table S3). We took a 
pathway-based approach to study the coherence of ESTs 
absolute abundances in the rapidly growing cells and the 
coherence of ESTs abundance alterations (or ratios) 
upon ND in B. sudeticus (see below). 

Overview of the pathway-based transcriptome analysis 

Based on KEGG databases, we focused on the coherence 
analysis of the absolute abundance (i.e., steady-state level 
in nitrogen-replete cells) and relative abundance (i.e., ratio 
between levels in ND and nitrogen-replete cells) of a 
subset of the metabolic pathway-associated ESTs. We 
assumed that co-regulated basal or constitutive transcrip- 
tion of the pathway-associated ESTs would display a high 
consistency in EST levels in rapidly growing cells. Likewise, 
co-regulated transcriptional alteration in response to ND 
would exhibit a high coherence of the pathway-associated 
EST ratios between levels in ND and nitrogen-replete cells. 

We used the occurrence density of the pathway- 
associated ESTs in a range that contained the first 50%, 
middle 50%, or second 50% of the ESTs based on ranks to 
approximate the transcriptional coherence of a given 
metabolic pathways (see Methods). Coherence of the con- 
stitutive transcription and ND-induced transcription of 
the pathway-associated ESTs was determined based on 
ranks by the absolute abundance and relative abundance 
or ratio, respectively. A metabolic pathway was designated 
as a co-transcriptionally regulated (or coherent) pathway 
if the occurrence density of the pathway-associated ESTs 
in one of the three half ranges was 2-fold above back- 
ground and the p-value was less than 0.05 based on ranks 
by level (i.e., for basal transcription) or ratio (i.e., for tran- 
scriptional response). We also tested the occurrence dens- 
ity in the top 10% of the most abundant ESTs based on 
ranks by level or in the top (or bottom) 10% of the most 
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Figure 2 Characteristics of the B. sudeticus transcriptome. A total of 55 thousand ESTs is assembled based on 53 million paired-end reads 
(90 bp in length), of which, -7,500 ESTs were annotated. (A) Length distribution of all ESTs. X- and Y-axis indicate the binned EST length and the 
number of ESTs in each bin, respectively. (B) Level distribution of all ESTs. Levels are normalized to RPKM. X- and Y-axis indicate the binned EST 
level (in RPKM) and the number of ESTs in each bin, respectively. (C) List of GO-terms that are associated with ESTs annotations. Top 10 GO-terms 
in each category are shown. X-axis indicates the GO-terms and Y-axis indicates the number of ESTs associated with the GO-terms. BP, CC, and MF 
stand for biological process, cellular component, and molecular function, respectively. (D) List of KEGG metabolic pathways that are associated 
with ESTs annotations. Top 30 metabolic pathways are listed. The display is identical to (C). 
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upregulated (or downregulated) ESTs based on ranks by 
ratio (e.g., relative occurrence density >2-fold above bacl<- 
ground and p-value < 0.05) for coherently regulated meta- 
bolic pathways (see Methods). 

We proposed that coherent pathways were potentially 
co-transcriptionaUy regulated. Given that the magnitude of 
alteration in transcription levels following ND was a func- 
tion of time, magnitude of change in transcription levels 
would vary in cells sampled at different time point after 
stress. On the other hand, coherence of transcriptional 
response for metabolic pathways would be remained 
throughout the course of response. 

Coherent transcription of the ribosomal protein-encoding 
ESTs in 6. sudeticus 

Absolute level of the 7,625 annotated ESTs in B. sudeticus 
was approximated by RPKM (see Methods). We found 
that 16 out of top 30 most abundant ESTs encoded puta- 
tive ribosomal proteins, indicating that RP-encoding ESTs 
are enriched in the topmost expressed ESTs compared to 
background (i.e., 53.3% versus 1.29%, p-value < 1.33E-07) 
in actively growing B. sudeticus cells (Table 1). This result 
is consistent with a notion that protein synthesis is one of 
the most active functions in actively growing cells [29]. 

Cytosolic or eukaryotic ribosomal proteins (i.e., eRPs) 
are encoded by nuclear genes in green microalgae. On the 
other hand, chloroplast and mitochondrion or archaea 
ribosomal proteins (i.e., aRPs) are encoded by both the 
nuclear and chloroplast or mitochondrion genes, respect- 
ively. Transcriptional coordination would be relatively 
easy to achieve for a group of genes localized in a single 
genome compared to those in two genomes such as nu- 
clear and chloroplast or mitochondrial genomes. Hence, 
we hypothesized that the transcription consistency of the 
eRP-encoding ESTs would be high compared to that of 
aRP-encoding ESTs. To test this hypothesis, we first deter- 
mined the relative occurrence density in the 3 half-ranges 
(i.e., first, middle, and second 50%) of the eRP- and aRP- 
encoding ESTs (i.e., nuclear encoded aRP genes) based on 
the rank by abundance (Figures 3A and 3B). It was clear 
that the relative occurrence densities in the first and 
middle halves of the eRP-encoding ESTs were much 
higher than background (i.e., 56.8-fold and 36.1 -fold 
above background; p-value < 0.001), indicating that 75% 
of the eRP-encoding ESTs displayed a coordinated level 
of constitutive transcription. On the other hand, the rela- 
tive occurrence density only in the range of the first 50% 
of the aRP-encoding ESTs was higher than background 
(i.e., 7.2-fold above background; p-value < 0.001). 

We noted that the occurrence density of both eRP- and 
aRP-encoding ESTs was clearly enriched in the top 10% of 
the most abundant ESTs in actively growing B. sudeticus 
cells (i.e., 9.9-fold and 5.7-fold above background; 
p-value < 0.001), indicating that protein synthesis is 



highly active in both cytoplasm and chloroplast/ mito- 
chondrial to support growth of B. sudeticus cells. 

We subsequently examined the relative occurrence 
density of the eRP- and aRP-encoding ESTs based on 
ranks by ratio between levels in ND and nitrogen-replete 
cells. Occurrence density of the eRP-encoding ESTs was 
found to be enriched in the range containing the middle 
half and the second half of the eRP-encoding ESTs 
(i.e., 3.24-fold and 2.29-fold above background; p-value < 
0.001), suggesting that change in transcription levels of 
eRP-encoding ESTs is consistent during response to ND 
(Figure 3C). On the other hand, occurrence density in all 
3 ranges of aRP-encoding ESTs was not enriched. This re- 
sult supports our hypothesis that co-transcriptional regu- 
lation of genes localized in a single genome is relatively 
easy to achieve compared to those located in different ge- 
nomes. We noted that transcription of RP genes was not 
significantly downregulated in B. sudeticus cells 3 days 
after ND (Figure 3D). 

Energy metabolism-associated ESTs appear to be most 
abundant and highly coherent in actively growing B. 
sudeticus cells 

We examined the consistency in levels of ESTs associated 
with various metabolic pathways in actively growing B. 
sudeticus cells. To this end, 66 KEGG metabolic pathways 
were found to associate with at least 10 ESTs based on an- 
notation of the B. sudeticus transcriptome (Additional file 
5: Table S4). The eRP- or aRP-encoding ESTs were in- 
cluded as control. Relative occurrence density of individ- 
ual metabolic pathways in ranges containing the first, 
middle, and second halves of the pathway-associated ESTs 
based on ranks by level was determined (Figure 4A). Four- 
teen out of the 66 metabolic pathways exhibited high rela- 
tive occurrence density in one of the 3 ranges of the 
pathway-associated ESTs based on ranks by level (i.e., 2- 
fold above background; p-value < 0.05), 5 of which were 
found to be enriched in the top 10% of the most abundant 
ESTs (i.e., 2-fold above background; p-value < 0.05). 

Coherent transcription was found in 5 carbohydrate, 2 
lipid, 2 amino acid, and 5 energy metabolic pathways 
(Figure 4B). Of the 14 coherently transcribed pathways, 
5 were enriched in the top 10% of the most abundant 
ESTs: 2 involved in carbohydrate metabolisms (glycolysis/ 
gluconeogenesis and pentose phosphate pathway) and 3 
involved in energy metabolisms (carbon fixation, oxidative 
phosphorylation, and photosynthesis). Glycolysis is the 
process of converting glucose into pyruvate and genera- 
ting small amounts of ATP (energy) and NADH (reducing 
power) molecules; and the pentose phosphate pathway 
is a process of glucose turnover that produces NADPH 
and pentoses (for nucleic acid) [30]. Both were primary 
metabolism and essential for cell growth. Photosynthesis 
(i.e., light-dependent photosynthetic reaction) is the process 
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Table 1 List of the top 30 most abundant ESTs in B. 
sudeticus 



Table 1 List of the top 30 most abundant ESTs in B. 

sudeticus (Continued) 





£10*= 


Level 


Ratio 


Genome 


PID*^ 


Description'' 


1 


3556 


4.03 


-0.50 


Chlre4 


376403 




2 


318 


3.98 


-0.24 


Chlre4 


322197 


RP-S29e, RPS29; small 
subunit ribosomal 
protein S29e 


3 


5 


3.81 


0.20 


Chlre4 


395228 




4 


36952 


3.78 


0.16 


ChlNC64A 


28889 




5 


106 


3.70 


-0.04 


Chlre4 


155068 




6 


57 


3.69 


0.18 


Chlre4 


407233 


Glucan1,4-alpha- 
glucosidase. 


7 


8 


3.67 


-0.21 


Chlre4 


195592 


RP-S20e, RPS20; small 
subunit ribosomal 
protein S20e 


8 


520 


3.65 


-0.03 


Chlre4 


195131 


RP-L22e, RPL22; large 
subunit ribosomal 
protein L22e 


9 


127 


3.62 


0.05 


Chlre4 


418234 




10 


142 


3.61 
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of harvesting light energy and generating ATP and 
NADH which in turn utiUzed to reduce carbon dioxide 
to organic molecules in a process known as carbon fixation 
(or light-independent photosynthetic reaction). Oxidative 
phosphorylation is the process of transferring electrons to 
oxygen and generating ATP in mitochondria. Majority 
ATP in cells were generated through oxidative phosphoryl- 
ation [30]. Hence, we concluded that photosynthesis and 
energy metabolisms are the most active functions required 
for rapidly growth of B. sudeticus. 

Transcription of photosynthetic ESTs is most drastically 
downregulated upon ND in 6. sudeticus 

To investigate metabolic pathways that exhibited co- 
transcriptional alteration in response to ND, the relative 
occurrence density of the 66 metabolic pathways in the 3 
ranges (i.e., the first, middle, and second halves) of the 
pathway-associated ESTs was determined based on ranks 
by ratio between levels in ND and nitrogen-replete cells 
(Figure 5A). Five out of the 66 metabolic pathways such 
as glutathione metabolism, ATP synthesis, oxidative phos- 
phorylation, photosynthesis, and riboflavin metabolism 
exhibited coherent response to ND (Figure 5B). It was in- 
teresting that 4 of the 5 coherently responsive metabolic 
pathways displayed a consistency in constitutive trans- 
cription (see Figure 4B). On the other hand, of the 5 
metabolic pathways exhibiting coherent response, only 
photosynthesis showed to be downregulated by 43.8% 
based on the median of photosynthesis-associated ESTs 
following ND (i.e., p-value < 0.001) (Figure 5C). Median 
level of the remaining 4 pathways was hardly altered. 
Photosynthesis was the only pathway whose associ- 
ated ESTs were enriched in the top 10% of the most 
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(See figure on previous page.) 

Figure 3 Coherence of transcription levels in ribosomal protein-encoding ESTs in B. sudeticus. (A) Distribution of ESTs encoding eRPs and 
aRPs based on ranl< by level. Tine heat-map shows the EST occurrence that is binned by an average sliding window of 1 1 consecutive ranks and 
normalized to 1 for each pathway or component. The color key for percent levels is shown at the bottom. The box-plot indicates the minimum 
(min), first quartile (q1), median (mid), third quartile (q3), and maximum (max) of eRP and aRP EST levels by rank (B) Occurrence densities of eRP 
and aRP ESTs. Upper table shows the range of ESTs based on rank by level. Four ranges are shown: first 50% (R-F50), middle 50% (R-M50), and 
second 50%) (R-S50) of eRP- or aRP-ESTs and top 10%) (R-Top) of most abundant ESTs based on rank by level. Mid table shows the occurrence 
density of eRP- or aRP-ESTs. Bottom table shows the relative occurrence density in 3 half-ranges containing 50% of the eRP or aRP-ESTs. Coherent 
transcription or relative density in one of the 3 half-ranges is 2-fold above background and p-value is less than 0.05. Different levels of statistica 
significance are indicated as triple asterisk (***), double asterisk (**), and single asterisk (*) for p-value < 0.001, < 0.01, and < 0.05, respectively. 
(C) Distribution of eRP- and aRP-ESTs based on rank by ratio. Top panel shows the heat-map and box-plot of eRP- and aRP-EST distribution based 
on rank by ratio. Bottom table shows the relative occurrence density of eRP- and aRP-ESTs in 3 half-ranges and in top or bottom of the most 
differentially transcribed ESTs. Levels of statistical significance are indicated as (B). (D) Distribution of eRP- and aRP-EST ratios. Change of median 
ratios (AA/1) of eRP- and aRP-ESTs compared to background is indicated. 



downregulated ESTs, indicating that photosynthesis is the 
most drastically downregulated pathway in B. sudeticus 
following ND. 

Consistent with the downregulation of photosynthesis- 
associated ESTs, chlorophyll contents (i.e., chl a and b) in 
ND cells were also greatly reduced by 3.1-fold compared 
to that of nitrogen-replete cells (i.e., 0.26±0.03% versus 
0.81±0.11% of CDW, p-value = 0.001) (Figure 5D, upper 
panel). We subsequently determined the relative quantum 
yield or efficiency of photosystem II (i.e., Y(II) = (Fm'-F)/ 
Fm', see Methods), a measurement that is directly related 
to the quantum efficiency of carbon assimilation or photo- 
synthesis [31]. Interestingly, photosynthesis efficiency 
was only reduced by 20% upon ND compared to 
nitrogen-replete conditions (i.e., 0.43±0.01 versus 0. 
54±0.01) (Figure 5D, bottom panel), not markedly re- 
duced as seen in levels of chlorophyll content. Marginal 
reduction of quantum yield is consistent with the observa- 
tion that cell mass increase and carbon assimilation were 
only slightly affected by ND stress (see Figure IE). These 
results implied that photosynthetic components such as 
light harvesting proteins are in excess under optimal 
growth conditions. These proteins would be recycled for 
synthesis of new proteins required for gametogenesis and 
zygospore formation under ND conditions. 

We investigated pathway-associated individual ESTs 
whose levels were greatly upregulated upon ND. From the 
1,185 ESTs that were associated with 68 pathways, we 
found that 30 involved in 34 pathways were upregulated 
by 3-fold or greater (Additional file 6: Figure SI). Two 
lipid metabolism-specific enzymes, monoacylglycerol lip- 
ase (EC 3.1.1.23) and 3-oxoacyl-[acyl-carrier-protein] re- 
ductase (EC 1.1.1.100) were found. They might play a role 
in TAG accumulation upon ND. 

Abundant and coherent transcription of energy metabolic 
genes is conserved in actively growing 6. sudeticus and C. 
reinhardtii cells 

We wanted to investigate if the basal transcription of en- 
ergy metabolic genes was abundant and coherent in the 



model green microalga C. reinhardtii. For this reason, 
we obtained the transcription profiles of C reinhardtii 
generated using Illumina technologies in the studies by 
Miller et al. and Gonzales-Ballester et al. [7,12]. Based 
on the best hits in C. reinhardtii, there were 4,148 B. 
sudeticus ESTs whose best hits (or homologs) were avail- 
able in transcriptomic datasets of C. reinhardtii [7,12] 
(Additional file 7: Table S5). By using the common set of 
4,148 ESTs, we found that correlation between basal 
transcription profiles in B. sudeticus and C. reinhardtii 
was moderate (i.e., correlation coefficient = 0.47 ~ 0.49). 
The correlation between the two independent experi- 
ments in C. reinhardtii was relatively high (i.e., correl- 
ation coefficient = 0.76), indicating a good quality of the 
two datasets (Figure 6A). 

Based on the set of 4,148 ESTs, 53 metabolic pathways 
had 10 or more associated ESTs (i.e., due to the reduced 
number of ESTs, 13 pathways were filtered off from the 
list of 66 pathways based on the set of 7,625 ESTs; see 
Figures 4 and 5). We found 14 metabolic pathways and 
ribosomes (i.e., eRP and aRP) showed coherent transcription 
in rapidly growing log-phase B. sudeticus cells (Figure 6B; 
Additional file 8: Table S6; Additional file 9: Table S7). 
Though 5 of the 14 pathways were not previously shown 
when a set of 7,625 ESTs were used (see Figure 4B), the 
coherent pathways with high level of statistical significance 
(i.e., p-value < 0.001) remained unchanged. 

To ensure a fair comparison, we applied a common set 
of 4,148 ESTs in B. sudeticus and C. reinhardtii. The rela- 
tive occurrence density of the 53 metabolic pathways in 3 
ranges (i.e., the first, middle, and second halves of the 
pathway-associated ESTs) was determined based on rank 
by level using the common set of 4,148 ESTs in B. sudeticus 
and C. reinhardtii (Figure 6B; Additional file 6: Figure SI). 

Comparative analysis showed that eRP- and aRP- 
encoding ESTs were both abundant and coherent in B. 
sudeticus and C. reinhardtii (Figure 6C). Furthermore, we 
found that there were 10 metabolic pathways that were 
coherent in B. sudeticus and C. reinhardtii. Of those, 3, 2, 
and 5 involved in carbohydrate metabolisms, amino acid 
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Figure 4 Coherence of transcription levels in various metabolic pathways in rapidly growing 6. sudeticus cells. (A) Distribution of 
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metabolic pathways are indicated on the far left. Numbers of ESTs associated with the pathway is shown in parentheses, RP stands for ribosomal 
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metabolisms, and energy metabolisms, respectively. We 
noted that 3 out of the 5 energy metabolic pathways were 
abundant in B. sudeticus. On the other hand, all 5 energy 
metabolic pathways were abundant (i.e., in one of the two 
experiments) in C. reinhardtii. In addition, 2 carbohydrate 



and 1 amino acid metabolic pathways were coherent and 
abundant in C. reinhardtii. Coherent pathways tended to 
be abundant in C. reinhardtii, but not in B. sudeticus. 

We noted that 4 pathways were coherent in B. 
sudeticus but not in C. reinhardtii (Figure 6D). On the 
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Figure 6 Coherently transcribed metabolic pathways in rapid growing B. sudeticus or C. reinhardtii cells. (A) Correlation between EST 
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other hand, 8 were coherent in C. reinhardtii but not in 
B. sudeticus, 4 of which were also abundant. Interest- 
ingly, all coherent pathways were found to be involved 
in energy, carbohydrate, and amino acid metabolisms, 
but not in lipid metabolisms except for sterol biosyn- 
thesis in B. sudeticus. 

Transcriptional alteration of metabolic pathways but not 
individual ESTs is correlated In B. sudeticus and C. 
reinhardtii following ND 

Unlike the absolute level of ESTs, the relative level (or 
ratio between levels in ND and nitrogen-replete cells) of 
ESTs in B. sudeticus showed no correlation (i.e., correl- 
ation coefficient = 0.13) with that of C. reinhardtii fol- 
lowing ND based on the common set of 4,148 ESTs 
(Figure 7A). On the other hand, a moderate correlation 
(i.e., correlation coefficient = 0.42) was observed be- 
tween EST ratios in C. reinhardtii following ND and sul- 
fur deprivation, suggesting that the global transcriptional 



response to ND was, to some extent, species-specific in 
green microalgae. 

To compare the ND-induced transcription of ESTs asso- 
ciated with various metabolic pathways in B. sudeticus and 
C. reinhardtii, we determined the relative occurrence dens- 
ity in the 3 half-ranges of the pathway-associated ESTs 
based on rank by ratio (Figure 7B). We noted that median 
ratio of the pathway-associated ESTs in B. sudeticus was 
clearly correlated with that of C. reinhardtii following ND 
but not sulfur deprivation (Figure 7C). This result sug- 
gested that transcriptional response of metabolic pathways 
rather than individual ESTs was conserved in green 
microalgae B. sudeticus and C reinhardtii following ND. 

In B. sudeticus, only eRP-ESTs but not aRP-ESTs showed 
coherent response to ND (Figure 7D). On the other hand, 
both eRP- and aRP-ESTs exhibited coherent response in C. 
reinhardtii. We noted that none of them displayed a sig- 
nificant alteration in transcriptional levels (based on the 
media) following ND in B. sudeticus and C reinhardtii. 
Interestingly, though, the median level of eRP- or aRP- 
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Figure 7 Coherently responsive metabolic pathways in 6. sudeticus or C. reinhardtii following nutrient deprivation. (A) Corre ation 
between EST ratios in 6. sudeticus (i.e., Bs) and C. reinhardtii (i.e., Cr-M) following ND and C, reinhardtii (i.e., Cr-GB) following sulfur deprivation (or SD). 
Correlation coefficient based on EST ratios is shown. (B) Relative occurrence density of various pathway-associated ESTs based on rank by ratio. The 
display is identical to the right panel in Figure 5A. (C) Correlation between median ratios of various metabolic pathway-associated ESTs based on rank 
by ratio in 6. sudeticus and C. reinhardtii following nutrient deprivation, (D) Common consistently responsive pathways in 6, sudeticus and C. reinhardtii 
following ND. The display of the upper and bottom panels is identical to right panel in Figure 5B and Figure 5C, respectively. (E) Unique consistently 
responsive pathways in fi. sudeticus and C, reinhardtii following nutrient deprivation. The display is identical to (D), 



ESTs was significantly altered in C reinhardtii following 
sulfur deprivation. 

We found that there were 7 metabolic pathways ex- 
hibited consistent response (i.e., occurrence density in 1 of 
3 half-ranges > 2-fold; p-value < 0.05) in B. sudeticus 



following ND, of which, 3 (i.e., citrate cycle, ATP synthesis, 
and photosynthesis) were also found to showed coherent 
response in C. reinhardtii in response to ND. While the 
median level of citrate cycle and ATP biosynthesis was not 
significandy changed, the median level of photosynthesis 
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was obviously downregulated in B. sudeticus and C. 
reinhardtii following ND. In fact, it was also greatly 
downregulated in C. reinhardtii following sulfur de- 
privation. Consistent with it, aRP- but not eRP-ESTs were 
also downregulated in C. reinhardtii following sulfur 
deprivation. These results suggested that transcriptional 
downregulation of photosynthesis is a common response to 
nutrient deprivation. 

Among the coherent pathways specific to species or 
stresses (Figure 7E), folate biosynthesis was upregulated 
(p-value < 0.01) in B. sudeticus following ND and por- 
phyrin and chlorophyll metabolism was downregulated 
(p-value < 0.01) in C. reinhardtii following sulfur de- 
privation. Folate and its derivatives are cofactors of one- 
carbon metabolism that is required for the biosynthesis of 
nucleotides and amino acids and upregulation of folate 
biosynthesis was in agreement with biosynthesis of novel 
proteins for gametogenesis upon ND. Chlorophyll is crit- 
ical to photosynthesis. Hence, downregulation of porphy- 
rin and chlorophyll metabolism was consistent with the 
repression of photosynthetic transcription. 

Discussion 

Based on the pathway-based approach, we show that the 
absolute level of ESTs encoding RPs and energy metabolic 
pathways such as photosynthesis, carbon fixation, and 
oxidative phosphorylation are highly coherent and abun- 
dant in B. sudeticus (see Figure 4 and Figure 6) and C 
reinhardtii (see Figure 6) by using the analysis of EST oc- 
currence density based on rank by level, implying that 
protein synthesis and energy metabolisms are the most 
active functions in the rapidly growing B. sudeticus and C. 
reinhardtii cells, possibly in all actively growing green 
microalgal cells. Interestingly, many coherent carbohy- 
drate and amino acid metabolic pathways in C reinhardtii 
but not in B. sudeticus tended to be abundant, probably 
reflecting the difference between B. sudeticus photo- 
trophic growth and C. reinhardtii heterotrophic growth. 
We noted that almost all of the pathways whose coher- 
ence in EST steady-state levels occurred in the first 50% 
(or the upper half) of the pathway-associated ESTs. It im- 
plies that there is a tight regulation of the abundantly 
expressed transcripts in microalgal cells. It would save en- 
ergy for optimal function of complex metabolic pathways 
in microalgae. 

ND is one of the most widely used methods for en- 
hancement of storage lipids in green microalgae [4-10]. 
However, ND induces cell cycle arrest and sexual differen- 
tiation and spore formation for survival [11]. In this study, 
we show that photosynthesis is only metabolic pathway 
whose transcription is most downregulated in B. sudeticus 
following ND and C. reinhardtii following ND or sulfur 
deprivation (see Figure 7). In C. reinhardtii following sul- 
fur deprivation, porphyrin and chlorophyll metabolism 



and aRP-ESTs are also downregulated. In plant, it is 
known that photosynthetic genes are downregulated in 
infected or damaged leaf by biotic agents such as arthro- 
pods, fungi, bacteria and viral pathogens [32]. Addition- 
ally, accumulation of carbohydrate under sink limitation 
can lead downregulation of photosynthesis in plant [33]. 
In unicellular green microalgae, it is not known if 
downregulation of photosynthesis is pardy a result of 
carbohydrate (e.g., starch) accumulation in chloroplast. 

In this study, we show that, upon ND, the level of nitro- 
gen in cells is reduced by 2.5-fold. While transcription of 
photosynthetic genes is drastically reduced, chlorophyll 
level is also reduced by 3-fold. On the other hand, under 
ND conditions, cell mass still increases. Carbon content 
in ND cells is only decreased by 7% compared to that of 
nitrogen-replete cells. This is consistent with the ob- 
servation that photosynthesis efficiency is only reduced by 
20%. These data indicate that drastic downregulation of 
photosynthetic transcripts or chlorophylls is primarily for 
recycling of nitrogen-rich proteins. It implies that photo- 
synthetic proteins such as light-harvesting proteins are in 
excess in B. sudeticus, similar to large-sized antenna- 
bearing C. reinhardtii. It is found that cells with small- 
sized antenna in C. reinhardtii are more productive com- 
pared to that with large-sized antenna [34] . 

In yeast, genes encoding RPs and protein synthesis are 
downregulated in response to a number of stress factors 
including ND [35,36]. However, ribosome activity is not 
abolished. It is required for novel protein synthesis for 
preparation of sexual differentiation and zygospore for- 
mation. In mammalian systems, it has been shown that 
downregulation of protein synthesis is not mediated 
through dramatic repression of RP gene transcription but 
through reduced efficiency of translation [37]. It is pos- 
sible that translation efficiency is decreased upon ND in 
green microalgae. This would explain the upregulation of 
eRP-ESTs in C. reinhardtii following sulfur deprivation 
(see Figure 7D). Downregulation of photosynthesis in 
algae following ND may provide a pool of nitrogen-rich 
molecules for synthesis of novel proteins required for 
adaptation and survival [38]. Given that most green 
microalgae resemble shade leaves having large-sized an- 
tenna, recycling of excessive photosynthetic proteins may 
be not immediately affect the photosynthesis capacity and 
carbon assimilation upon ND. 

Transcriptional response to ND has been thor- 
oughly studied in diatoms [39-41]. In Phaeodactylum 
tricornutum, carbon assimilation is found to continue 
after ND [39]. In addition, TAG content but not total 
lipid content is greatly increased following ND [39]. 
Noticeable reduction of photosynthesis activity (e.g., 
20% of total activity) upon ND has also observed in 
diatom [42]. These results are very similar to what we 
have observed in B. sudeticus (in this study). 
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ND induces lipid accumulation in B. sudeticus (Figures IC 
and ID) and in C reinhardtii [4,6,7,9]. However, no ap- 
parent up-regulation of lipid metabolic pathways based on 
KEGG database [43] is observed following ND in B. 
sudeticus and C. reinhardtii. This is consistent with the 
notion that regulation of storage lipid accumulation is 
complex in cells following ND: remodeling of membrane 
lipids for gametogenesis and accumulation of storage 
lipids through de novo biosynthesis and membrane lipid 
recycling [7]. Alternatively, regulation of TAG biosyn- 
thesis may involve post-translational modification [44]. 
Comparative analysis of global transcriptional response to 
ND in other microalgal species should provide insight 
into underlying molecular mechanisms of storage lipid ac- 
cumulation following ND. Dissecting pathways respon- 
sible for lipid accumulation from those regulating sexual 
differentiation would allow construction of microalgal 
strains suitable for biomass and lipid production. 

Conclusions 

In this study, we utilized the commercial software CLC 
Genomics Workbench [16] to assemble the transcrip- 
tome without reference genome (see Methods). We 
show a clear correlation between transcription abun- 
dances of homologs in B. sudeticus and C reinhardtii, 
suggesting a decent assembly of B. sudeticus transcrip- 
tome without reference genome using commercial soft- 
ware (see Figures 6 and 7). It provides a way for 
biologists to perform transcriptome studies in nonmodel 
organisms (i.e., no reference genome sequences) without 
much bioinformatics skills. By using the pathway-based 
approach, without performing time-course experiments, 
we are able to identify a number of probable gene regu- 
latory networks (i.e., genes with coherent transcription 
in a pathway) that are involved in regulation of various 
metabolic pathways in rapidly growing or ND-induced 
B. sudeticus cells. Our result is useful for construction of 
motif-specific transcription regulatory networks when 
genome sequence is available. For transcriptome analysis 
involving a large numbers of samples such as in a time- 
course experiment with multiple repeat, it is economical 
to use DNA microarray-based technologies. Hence, de 
novo assembled transcriptome is also useful in designing 
DNA microarray. 

Methods 

Algal strains and culture manipulation 

The Botryosphaerella (or Botryococcus) sudeticus (UTEX2629) 
strain was obtained from the Culture Collection of Auto- 
trophic Organisms (http://ccala.butbn.cas.cz/) with a 
strain catalog number of CCALA780 and the C 
reinhardtii (UTEX89 or CC1009) strain was obtained 
from Chlamy Center (www.chlamy.org). CC1009 is a cell 
wall proficient and NITl^ and NIT2^ strain. The cells 



were cultivated in 1 L of HS medium [25] or Hold's modi- 
fied Bristol (BB) [26] medium in a 2.5 L low-form flask 
with shaldng at 100 rpm at 25°C under continuous illu- 
mination of ~250 mol photon m'^ sec '^. Culture was sup- 
plied with 2% CO2 through bubbling. BB medium without 
sodium nitrate (or BB-N) was used as nitrogen-depleted 
medium. 

Cell density of cultures was determined using cell dry 
weight (CDW) in g/L. Approx. 50 ml cell culture were 
harvested by filtration using the glass fiber filter GF/A 
(Whatman/GE Healthcare, Kent, UK) and dried in oven 
at 80°C overnight. CDW was measured in triplicate using 
the AG204 balance (Mettler-Toledo Inc., Columbus, OH). 

The ratio of flagellates in cultures was determined by 
counting a total of 500 individual cells (e.g., flagellate or 
non-flagellate) in at least five independent fields in tripli- 
cate using Zeiss Axiovert 200 M microscope (Carl Zeiss 
AG, Oberkochen, Germany). 

To assess flocculation, shaken flasks (Corning Incor- 
porated, Corning, NY, USA) were brought to standstill 
for 30 min to score for flocculation or sedimentation of 
cells at the bottom of flasks. 

Log- phase culture in BB+N medium at the CDW of -1.2 
g/L is diluted by 8-fold in fresh BB+N and BB-N medium 
and cultivated for 3 days as nitrogen-replete and nitrogen- 
depleted cultures, respectively. 

Elemental analyses 

To determine nitrogen and carbon content of cells, 
approx. 3mg dry cell powder were subjected to elemen- 
tal analysis using the the Vario Micro Cube (Elementar 
Analysensysteme GmbH, Hanau, Germany) in the Elem- 
ental Analysis Laboratory, Department of Chemistry, 
National University of Singapore. To determine nitrate 
in medium, 2 ml culture supernatant was subjected to 
OD measurement using the Spectroquant Test Kit for 
Nitrate (Merck KGaA, Darmstadt, Germany) according 
to the manufacturer's instruction. 

Analyses of chlorophyll fluorescence and PAM- 
fluorescence of photosynthesis 

To determine chlorophyll content, cells from 2 ml culture 
were harvested by centrifugation. The resulting cells were 
resuspended in 96% ethanol and broken in the glass-bead 
beater FastPrep homogenizer (MP Biomedicals, Solon, OH, 
USA). After incubation with ethanol for 2 h at 4°C, cell 
debris was cleared by centrifugation. The resulting super- 
natant was subjected to OD measurement at the wave- 
length of 645 nm and 663 nm, with the 96% ethanol as 
blank. Total chlorophyll content (i.e., chl a and b) was esti- 
mated using a formula C (mg/L) = 20.2OD645+8.05D663 
[45] and was converted to % of CDW. 

To determine photosynthesis efficiency, cells were 
subjected to PAM-fluorescence analysis using the fluorometer 
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Imaging-PAM (Heinz Walz GmbH, Effeltrich, Germany) 
according to the manufacturers instruction. Briefly, 
the Fm' and F were determined at the arctic light of 
250 |imol photon m' sec' to mimic the growth condi- 
tions. The quantum yield was based on the formula Y(II) = 
(Fm'-F)//Fm' [46]. 

Fluorescence microscopic analyses and fluorescence- 
activated cell sorting (FACS) analysis 

To examine the lipid content in B. sudeticus cells, 4 |il 
of 0.25 mg/ml Nile red in acetone was directiy added to 1 
ml of fresh culture. After incubation at RT in dark for 15 
min, fluorescence signals in B. sudeticus cells were 
examined using the Zeiss Axiovert 200 M (Carl Zeiss 
AG, Oberkochen, Germany) with a Zeiss EC Plane- 
Neofluar 40x/0.75 objective lens. The images were cap- 
tured by a CoolSNAP HQ monochrome digital camera 
(Roper Scientific, Ottobrunn, Germany) and processed using 
MetaMorph software (Molecular Devices, Sunnyvale, CA). 

B. sudeticus cells were fixed with 10% formaldehyde (w/v) 
at RT for 15 min. The cells were washed twice with IxPBS 
and stained by Nile red at the final concentration of 1 |ig/ml 
for 15 min in dark. In microscopic analysis, Nile red 
stained cells were examined using the Zeiss Axiovert 200 
M (Carl Zeiss AG, Oberkochen, Germany) with A Zeiss 
filter cube no. IS {EX BP 546/12, BS FT 580, EM LP 590). 
In FACS analysis, Nile red stained cells were subjected to 
the analysis of BD FACS Calibur flow cytometer (Becton 
Dickinson Biosciences, San Jose, CA). 

TLC and GC-MS analysis 

Total lipid was exacted with methanol/ chloroform (2:1 
by volume) solution and was quantified via gravimetric 
analysis after evaporation of solvents. Then it was 
dissolved in chloroform at a concentration of 0.1 mg/|il. 
Equal amount of lipid was loaded on silica TLC plate 
(60F254, Merck Corporate, Whitehouse Station, NJ, USA) 
and developed in hexane/ diethyl ether/ acetic acid 
(35:15:0.1 by volume). TAG (tri-oleic acid (C18:l, [cis-9]) 
glyceride) and FFA (oleic acid (C18:l, [cis-9])) (Sigma- Al- 
drich, St. Louis, MO) was used as standard. Lipid profile 
on TLC plate was developed using iodine vapor. TAG and 
FAA contents were estimated through densitometric ana- 
lysis of TLC profile. Hydrocarbons were exacted in equal 
amounts of total lipids with hexane. Hydrocarbons were 
analyzed using Shimadzu GC/MS-QP2010 Plus system 
(Shimadzu, Kyoto, Japan) equipped with a HP-5ms Ultra 
Inert column (30 m x 0.25 mm x 0.25 |am, Agilent Tech- 
nologies, Santa Clara, CA). 

Construction of cDNA libraries for next-generation 
sequencing analysis 

Total RNA was extracted from B. sudeticus cells using 
TRIzol" Plus RNA Purification System (Invitrogen-Life 



Technologies Co., Carlsbad, CA, USA) according to 
manufacturer's protocol. Appr. 4 [^g of the resulting total 
RNA was used for synthesis of cDNA using the TruSeq 
RNA Sample Prep Kit (Invitrogen-Life Technologies 
Co.) according to manufacturer's instruction including 
synthesis of first and second strands cDNA, end repair, 
3'-end adenylation, adapter ligation, fragment enrich- 
ment (e.g., -260 bps in length), and library validation, 
quantification, and quality assessment with a bioanalyzer 
(Agilent Technologies; Santa Clara, CA, USA). The li- 
braries are sequenced using the lUumina HiSeq 2000 Se- 
quencer (BGI, Shenzhen, China). 

De novo assembly of lllumina short sequence reads 

Over 2 Giga-base clean paired-end reads (90 bps in 
length/ read) from each library were generated using 
HiSeq2000 technology. A total of 56 million reads from 
both growth conditions were pooled and subjected to de 
novo assembly using the CLC Genomics Workbench soft- 
ware (CLC Bio, Swansea, UK). As a result, 56 thousand 
nonredundant contigs/ scaffolds/ ESTs ranging from 200 
to 15,500 bps in length were obtained. More than 90% of 
total reads from each growth condition were mapped back 
to the 56 thousand ESTs, -99% of which were found in 
both conditions. Read hits per EST were normalized to 
RPKM (Reads Per Kilobase per Million mapped reads) for 
estimation of ESTs transcription levels. 

Annotation of the B. sudeticus transcriptome 

All nonredundant ESTs were subjected to sequence hom- 
ology comparison using the Basic Local Aliment Search 
Tool BLASTX suit against the "best" proteins in the model 
green microalgae C. reinhardtii and C. variabilis (genome. 
jgi-ps£org/Chlre4 and genome.jgi-ps£org/ChlNC64A_l). A 
total of 6,559 ESTs showed to have a best hit (i.e., expect- 
ation value < lE-06) in the C. reinhardtii "best" proteins. 
Among the remaining ESTs, 1,066 showed to have a best 
hit (i.e., expectation value < lE-06) in the C. variabilis 
"best" proteins. To this end, 7,625 ESTs (13.6%) in B. 
sudeticus were annotated based on the best hits in C. 
reinhardtii and C. variabilis (Additional file 2 Table SI). Of 
7,625 ESTs, 4,860 were found to be associated with at least 
one of the Gene-ontology (GO) terms and 1,129 were asso- 
ciated with at least one of the Kyoto Encyclopedia of Genes 
and Genomes (KEGG) functions. 

Analysis of relative occurrence density of the pathway- 
associated ESTs 

Based on EST annotations, 66 (or 53) metabolic path- 
ways were found to have 10 or more pathway-associated 
ESTs by using the set of 7,625 (or 4,148) ESTs (Add- 
itional file 5: Table S4). Average occurrence density (or 
background) of the pathway-associated ESTs is calcu- 
lated as the ratio between numbers of pathway- 
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associated ESTs and all ESTs tested. Occurrence density 
in the range containing first 50%, middle 50%, or second 
50% of the pathway-associated ESTs is calculated as the 
ratio between 50% of the pathway-associated ESTs and 
all ESTs within the respective range. Relative occurrence 
density in a particular range is the ratio between the oc- 
currence density in that range and the background. 

Coherence of the constitutive transcription levels (or 
ND-induced transcription ratios) of various metabolic 
pathway-associated ESTs is defined as those whose rela- 
tive occurrence density in one of the 3 half-ranges based 
on rank by level (or ratio) is 2-fold above background 
and p-value < 0.05. We assessed whether the coherent 
pathways were the most abundant (or most differently 
transcribed) ones in B. sudeticus by using the relative oc- 
currence density in the top 10% of the most abundant 
(or most differentially transcribed) ESTs based on rank 
by level (or ratio) (i.e., >2-fold above background; p- 
value < 0.05). 

The distribution of pathway-associated ESTs based on 
ranks (i.e., level or ratio) in heat-map display is binned 
by an average sliding window of 11 consecutive ranks. 
The entire distribution of the ESTs in each pathway is 
normalized to 1 or 100%. 

Comparative analysis between B. sudeticus and C. 
reinhardtii 

Of 7,625 annotated ESTs in B. sudeticus, 6,559 have a 
best hit in the C. reinhardtii "best" proteins. Based on 
the transcriptome analysis by Miller et al. [7] and 
Gonzalez-Ballester et al. [12], 4,148 out of 6,559 ESTs 
were available for comparisons. Therefore, coherence of 
transcriptions of 53 metabolic pathways was determined 
based on the set of 4,148 ESTs (Additional file 8: 
Table S6). 

Statistical analysis 

Binomial test was used to determine the statistical sig- 
nificance of enrichment for a number of pathway associ- 
ated ESTs (i.e., observed) that are present within a range 
of ranked ESTs (i.e., trials). Average occurrence (or back- 
ground level) of a pathway associated ESTs is the ratio 
between numbers of the pathway associated ESTs and all 
ESTs tested. 

T-test was used to determine the statistical significance 
of level changes of pathway associated ESTs compared 
to all ESTs tested (or background). P-values are 
subjected to Bonferroni correction according to numbers 
of individual pathways tested against background. 

The raw HiSEQ2000 pair-end sequencing data used in 
this study are available at the NCBI's Sequence Read 
Achieve (http://www.ncbi.nlm.nih.gov/sra) with the SRA 
Study accession number, SRA026047. 
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